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Description 

[0001] The invention relates to newly identified polynucleotides, polypeptides encoded by them and to the use of 
such polynucleotides and polypeptides, and to their production. More particularly, the polynucleotides and polypeptides 
5 of the present invention relate to human carnosinase, hereinafter referred to as HC, and its use. The invention also 
relates to inhibiting or activating the action of such polynucleotides and polypeptides. 

[0002] Carnosine and related dipeptides such as anserine are naturally-occurring histidine-containing compounds. 
They are found in several tissues, most notably in muscle, where they represent an appreciable fraction of the total 
water-soluble nitrogen-containing compounds. The biological roles of these dipeptides are conjectural but they are 

10 believed to act as cytosolic buffering agents. Numerous studies have demonstrated, both at the tissue and organelle 
levels, that they possess strong and specific antioxidant properties. Carnosine and related dipeptides have been shown 
to prevent peroxidation of model membrane systems, leading to the suggestion that they represent water-soluble coun- 
terparts of lipid-soluble antioxidants, such as alpha-tocopherol, in protecting cell membranes from oxidative damage. 
Other roles ascribed to these dipeptides include actions as neurotransmitters, modulation of enzymatic activities and 

15 chelation of heavy metals. Many therapeutic actions of carnosine and histidine-containing dipeptides have been sug- 
gested, including antihypertensive effects, actions as immunomodulating agents, wound healing and antineoplastic 
effects (Mot Aspects Med 1 992; 1 3(5): 379-444). 

[0003] Carnosine and anserine are hydrolyzed in serum by the enzyme carnosinase. Carnosinase from human plas- 
ma has been purified (Clin Chim Acta 1991 Feb 15; 196(2-3): 193-205). Human serum carnosinase is found to be a 

20 . glycoprotein with an isoelectric point of 4.4 and a subunit molecular weight of 75,000; the active enzyme being a dimer 
and the two subunits being connected by one or more disulfide bonds. The enzyme is especially active in hydrolyzing 
carnosine and anserine, preferring dipeptides which contain histidine in the C-terminal position. Brain contains a high 
amount of carnosinase. The physiological function of this enzyme seems to be the hydrolysis of homocarnosine in the 
brain and the splitting of carnosine and anserine in the blood stream. 

25 [0004] Several reports describe variable phenotypes for a serum carnosinase deficiency, which range from normal 
to severe psychomotor retardation, hypotonia, and myoclonic seizures in the first year of life. A terminal deletion of 
chromosome 18 with breakpoint at q21.3 has been reported in the case of a 30-month-old girl with hypotonia, devel- 
opmental delays, and tremor. Serum carnosinase activity of this girl was found to be extremely low. These findings 
suggest that the locus for carnosinase resides on the distal long arm of chromosome 18 (Pediatric Res. 1997 Feb;41 

30 (2):210-3). 

[0005] Two unrelated children with a progressive neurologic disorder characterized by severe mental defect and 
myoclonic seizures have been described (Lancet 1: 1 229-1 230, 1 968). Both excreted carnosine in the urine, even when 
any source of the dipeptide is excluded from the diet. Both had unusually high concentrations of homocarnosine in the 
cerebrospinal fluid. It was suggested that one, and perhaps both, presented a defect in carnosinase activity, this enzyme 
35 being almost absent in the two patients. Autopsy on one of the boys (7-year old), whose parents had a low carnosinase 
activity, showed severe axonal degeneration, numerous 'spheroids' in the gray matter, demyelinization, fibrosis, and 
loss of Purkinje fibers (Neurology 22: 644-654, 1 972). 

[0006] Measurement of Human Serum Carnosinase may be useful in the assessment of patients with acute stroke 
with respect to diagnosis and prediction of clinical outcome (Stroke 1996 Nov;27(11):2064-8). 
40 [0007] Serum carnosinase activity is shown to be altered in patients with neurological disorders. Reduced serum 
carnosinase activity has been observed in patients with Parkinson's disease and multiple sclerosis compared with the 
control group. Carnosinase activity, 5-10% of that found in serum, was detected in cerebrospinal fluid (CSF) samples 
(Clin Chim Acta 1994 Feb;225(1):57-64). The cause of reduced serum carnosinase activity in central nervous system 
disorders remains unclear. 

45 [0008] An interesting finding suggests that in addition to glutamate decarboxylation, a second metabolic reaction for 
the formation of free GABA exists in specific neuronal tracts of the human CNS where GABA is released from homo- 
carnosine by the action of serum carnosinase (Brain Res. Bull. 1994;33(4):379-85). 

[0009] This indicates that human carnosinase has an established, proven history as therapeutic target and that it 
can play a role in preventing, ameliorating or correcting dysfunctions or diseases, including, but not limited to, cognitive 

so disorders, developmental abnormalities and fetal deficiencies, neurodegenerative disorders such as amyotrophic lat- 
eral sclerosis, Parkinson's disease, schizophrenia, abnormal mental states, ischemic shock, epilepsia. 
[0010] In one aspect, the invention relates to HC polypeptides and polynucleotides, and to recombinant materials 
and methods for their production. Another aspect of the invention relates to methods for using such HC polypeptides 
and polynucleotides, for example in the treatment of Alzheimer's disease and cognitive disorders, developmental ab- 

55 normalities and fetal deficiencies, neurodegenerative disorders such as amyotrophic lateral sclerosis, Parkinson's dis- 
ease, schizophrenia, abnormal mental states, ischemic shock, epilepsia, among others. 

[0011] In still another aspect, the invention relates to methods to identify HC agonists and antagonists using the 
materials provided by the invention, and treating conditions associated with HC. Yet another aspect of the invention 
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relates to diagnostic assays for detecting diseases associated with inappropriate HC activity or levels. 

[0012] Figure 1 is an histogram showing the relative abundance of the messenger RNA of HC in different human 

tissues. 

[0013] Figure 2 is an histogram of the micro-fluorescent measurement of activity of HC in transfected CHO cells. 
5 [0014] Figure 3 is an histogram showing the abundance of mRNA level of HC in patients with Alzheimer's disease 
(AD) compared with control patients. 

[0015] The following definitions are provided to facilitate understanding of certain terms used frequently herein. 
[0016] "HC" or "HC polypeptide" refers generally to a polypeptide having the amino acid sequence set forth in SEQ 
ID NO:2 or an allelic variant thereof, as well as the variants as described further below. 

10 [0017] "HC activity" or "HC polypeptide activity" or "biological activity of HC" or "biological activity of HC polypeptide" 
refers to the metabolic or physiologic function of said HC including similar activities or improved activities, or these 
activities with decreased undesirable side-effects. Also included are antigenic and immunogenic activities of said HC. 
[0018] "HC gene" refers to a polynucleotide having the nucleotide sequence set forth in SEQ ID NO:1 or allelic 
variants thereof and/or their complements as described in the following specification. 

15 [0019] "Antibodies" as used herein includes polyclonal and monoclonal antibodies, chimeric, single chain, and hu- 
manized antibodies (an antibody - mouse or any other-with some specific parts changed to human homologous re- 
gions), as well as Fab fragments, including the products of a Fab or other immunoglobulin expression library. 
[0020] "Isolated" means altered "by the hand of man" from the natural state. If an "isolated" composition or substance 
occurs in nature, it has been changed or removed from its original environment, or both. For example, a polynucleotide 

20 or a polypeptide naturally present in a living animal is not "isolated," but the same polynucleotide or polypeptide sep- 
arated from the coexisting materials of its natural state is "isolated", as the term is employed herein. For example, 
recombinant DNA molecules contained in a vector are considered isolated for the purposes of the present invention. 
Further examples of isolated nucleic acid molecules include recombinant DNA molecules maintained in heterologous 
host cells or purified DNA molecules in solution. Isolated RNA molecules include in vivo or in vitro RNA transcripts of 

25 the DNA molecules of the present invention. Isolated nucleic acid molecules according to the invention further include 
such molecules produced synthetically. 

[0021] "Polynucleotide" generally refers to any polyribonucleotide or polydeoxyribonucleotide, which may be unmodi- 
fied RNA or DNA or modified RNA or DNA. "Polynucleotides" include, without limitation, single- and double-stranded 
DNA, DNA that is a mixture of single- and double-stranded regions, single- and double-stranded RNA, and RNA that 

30 is mixture of single- and double-stranded regions, hybrid molecules comprising DNA and RNA that may be single- 
stranded or, more typically, double-stranded or a mixture of single- and double-stranded regions. In addition, "polynu- 
cleotide" refers to triple-stranded regions comprising RNA or DNA or both RNA and DNA. The term polynucleotide 
also includes DNAs or RNAs containing one or more modified bases and DNAs or RNAs with backbones modified for 
stability or for other reasons. "Modified" bases include, for example, tritylated bases and unusual bases such as inosine. 

35 a variety of modifications have been made to DNA and RNA; thus, "polynucleotide" embraces chemically, enzymatically 
or metabolically modified forms of polynucleotides as typically found in nature, as well as the chemical forms of DNA 
and RNA characteristic of viruses and cells. Thus, "polynucleotide" also encompasses variants of the nucleic acid 
molecules of the invention which encode portions, analogs or derivatives of HC. Variants may occur naturally, such as 
natural allelic variants. By an "allelic variant" is intended one of several alternate forms of a gene occupying a given 

*o locus on a chromosome of an organism. "Polynucleotide" also embraces relatively short polynucleotides, often referred 
to as oligonucleotides. 

[0022] "Polypeptide" refers to any peptide or protein comprising two or more amino acids joined to each other by 
peptide bonds or modified peptide bonds, i.e., peptide isosteres. "Polypeptide" refers to both short chains, commonly 
referred to as peptides, oligopeptides or oligomers, and to longer chains, generally referred to as proteins. Polypeptides 

45 may contain amino acids other than the 20 gene-encoded amino acids. "Polypeptides" include amino acid sequences 
modified either by natural processes, such as post-translational processing, or by chemical modification techniques 
which are well known in the art. Such modifications are well described in basic texts and in more detailed monographs, 
as well as in a voluminous research literature. Modifications can occur anywhere in a polypeptide, including the peptide 
backbone, the amino acid side-chains and the amino or carboxyl termini. It will be appreciated that the same type of 

so modification may be present in the same or varying degrees at several sites in a given polypeptide. Also, a given 
polypeptide may contain many types of modifications. Polypeptides may be branched as a result of ubiquitination, and 
they may be cyclic, with or without branching. Cyclic, branched and branched cyclic polypeptides may result from post- 
translation natural processes or may be made by synthetic methods. Modifications include acetylation, acylation, ADP- 
ribosylation, amidation, covalent attachment of flavin, covalent attachment of a heme moiety, covalent attachment of 

55 a nucleotide or nucleotide derivative, covalent attachment of a lipid or lipid derivative, covalent attachment of phos- 
photidylinositol, cross-linking, cyclization, disulfide bond formation, demethylation, formation of covalent cross-links, 
formation of cystine, formation of pyroglutamate, formylation, gamma-carboxylation, glycosylation, GPI anchor forma- 
tion, hydroxylation, iodination, methylation, myristoylation, oxidation, proteolytic processing, phosphorylation, prenyla- 
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tion, racemization, selenoylation, sulfation, transfer-RNA mediated addition of amino acids to proteins such as argi- 
nylation, and ubiquitination. See, for instance, Proteins - Structure and Molecular Properties, 2nd Ed., T. E. Creighton, 
W. H. Freeman and Company, New York, 1993 and Wold, R, Post-translational Protein Modifications: Perspectives 
and Prospects, p. 1-12 in Post-translational Covalent Modification of Proteins, B. C. Johnson, Ed., Academic Press, 
5 New York, 1983; Seifter et ai, "Analysis for protein modifications and non-protein cofactors", Meth. Enzymol. (1990) 
1 82:626-646 and Rattan ef ai, "Protein Synthesis: Post-translational Modifications and Aging", Ann NY Acad Sci (1 992) 
663:48-62. 

[0023] "Variant" as the term is used herein, represents a polynucleotide or polypeptide that differs from a reference 
polynucleotide or polypeptide respectively, but retains essential properties. A typical variant of a polynucleotide differs 

10 jn nucleotide sequence from another, reference polynucleotide. Changes in the nucleotide sequence of the variant 
may or may not alter the amino acid sequence of a polypeptide encoded by the reference polynucleotide. Nucleotide 
changes may result in amino acid substitutions, additions, deletions, fusions and truncations in the polypeptide encoded 
by the reference sequence, as discussed below. Generally, differences are limited so that the sequences of the refer- 
ence polypeptide and the variant are closely similar overall and, in many regions, identical. A variant and reference 

*5 polypeptide may differ in amino acid sequence by one or more substitutions, additions, deletions in any combination. 
A substituted or inserted amino acid residue may or may not be one encoded by the genetic code. A variant of a 
polynucleotide or polypeptide may be a naturally occurring such as an allelic variant, or it may be a variant that is not 
known to occur naturally. Non-naturally occurring variants of polynucleotides and polypeptides may be made by mu- 
tagenesis techniques or by direct synthesis. 

20 [0024] "Identity" is a measure of the identity of nucleotide sequences or amino acid sequences. In general, the se- 
quences are aligned so that the highest order match is obtained. "Identity" per se has an art-recognized meaning and 
can be calculated using published techniques. See, e.g.: Computational Molecular Biology, Lesk, A. M., ed., Oxford 
University Press, New York, 1988; Biocomputing : Informatics and Genome Projects, Smith, D. W., ed., Academic 
Press, New York, 1993; Computer Analysis of Sequence Data, Part I, Griffin, A. M., and Griffin, H. G., eds., Humana 

25 Press, New Jersey, 1994; Sequence Analysis in Molecular Biology, von Heinje, G., Academic Press, 1987; and Se- 
quence Analysis Primer, Gribskov, M. and Devereux, J., eds., M Stockton Press, New York, 1991. While there exists 
a number of methods to measure identity between two polynucleotide or polypeptide sequences, the term "identity" is 
well known to skilled artisans (Carillo, H., and Lipton, D., SIAM J Applied Math (1988) 48:1073). Methods commonly 
employed to determine identity or similarity between two sequences include, but are not limited to, those disclosed in 

30 Guide to Huge Computers, Martin J. Bishop, ed., Academic Press, San Diego, 1994, and Carillo, H., and Lipton, D., 
SIAM J Applied Math (1988) 48:1073. Methods to determine identity and similarity are codified in computer programs. 
Preferred computer program methods to determine identity and similarity between two sequences include, but are not 
limited to, GCG program package (Devereux, J., ef al , Nucleic Acids Research (1984) 12(1 ):387), BLASTP, BLASTN, 
FASTA (Atschul, S.F. et ai, J. Molec. Biol. (1990) 215:403) ... . 

35 [0025] . As an illustration, by a polynucleotide having a nucleotide sequence having at least, for example, 95% "iden- 
tity" to a reference nucleotide sequence is intended, a polynucleotide in which the nucleotide sequence is identical to 
the reference sequence except that the polynucleotide sequence may include up to five point mutations per each 100 
nucleotides of the reference nucleotide sequence. In other words, to obtain a polynucleotide having a nucleotide se- 
quence at least 95% identical to a reference nucleotide sequence, up to 5% of the nucleotides in the reference sequence 

40 may be deleted or substituted with another nucleotide, or a number of nucleotides up to 5% of the total nucleotides in 
the reference sequence may be inserted into the reference sequence. These mutations of the reference sequence 
may occur at the 5 or 3 terminal positions of the reference nucleotide sequence or anywhere between those terminal 
positions, interspersed either individually among nucleotides in the reference sequence or in one or more contiguous 
groups within the reference sequence. 

45 [0026] Similarly, by a polypeptide having an amino acid sequence having at least, for example, 95% "identity" to a 
reference amino acid sequence is intended a polypeptide having an amino acid sequence identical to the reference 
sequence except that the polypeptide sequence may include up to five amino acid alterations per each 100 amino 
acids of the reference amino acid sequence. In other words, to obtain a polypeptide having an amino acid sequence 
at least 95% identical to a reference amino acid sequence, up to 5% of the amino acid residues in the reference 

50 sequence may be deleted or substituted with another amino acid, or a number of amino acids up to 5% of the total 
amino acid residues in the reference sequence may be inserted into the reference sequence. These alterations of the 
reference sequence may occur at the amino or carboxy terminal positions of the reference amino acid sequence or 
anywhere between those terminal positions, interspersed either individually among residues in the reference sequence 
or in one or more contiguous groups within the reference sequence. 

55 [0027] In one aspect, the present invention relates to HC polypeptides. The HC polypeptides of the invention include 
the polypeptide of SEQ ID NO:2; as well as polypeptides comprising the amino acid sequence of SEQ ID NO: 2; and 
polypeptides comprising the amino acid sequences which have at least 80% identity to that of SEQ ID NO:2 over its 
entire length, and still more preferably at least 90% identity, and even still more preferably at least 95% identity to SEQ 
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ID NO: 2. Furthermore, those with at least 97-99% identity are highly preferred. Also included within HC polypeptides 
are polypeptides having the amino acid sequences which have at least 80% identity to the polypeptide having the 
amino acid sequence of SEQ ID NO:2 over its entire length, and still more preferably at least 90% identity, and still 
more preferably at least 95% identity to SEQ ID NO:2. Furthermore, those with at least 97-99% identity are highly 
5 preferred. Preferably HC polypeptides exhibit at least one biological activity of HC. It will be recognized that some 
amino acids of the HC polypeptides can be varied without significant effect on the structure or function of the protein. 
If such differences are contemplated, it should be remembered that there will be critical areas on the protein which 
determine activity. 

[0028] Thus, the invention further provides variants of the HC polypeptides which show substantial HC polypeptide 
10 activity or which include regions of HC polypeptide such as the protein portions discussed below. Such mutants include 
HC polypeptides in which deletions, insertions, type substitutions, inversions and/or repeats have occurred. Guidance 
concerning amino acid changes which are likely to be phenotypically silent can be found in Bowie, J. U. et al, "Deci- 
phering the Message in Protein Sequences : Tolerance to Amino Acid Substitutions", Science 247 : 1 306-1 310(1 990). 
[0029] Thus, the fragment, derivative or analog of the polypeptide of SEQ ID NO:2, or that encoded by the deposited 
15 cDNA as discussed below, may be : 

one in which one or more of the amino acid residues are substituted with a conserved or non-conserved amino 
acid residue, preferably a conserved amino acid residue, and such substituted amino acid residue may or may not 
be one encoded by the genetic code, or 
20 - one in which one or more of the amino acid residues includes a substituent group, or 

one in which the mature polypeptide is fused with another compound, such as a compound for increasing the half- 
life of the polypeptide (for example, polyethylene glycol), or 

one in which additional amino acids are fused to the mature polypeptide, such as the IgG Fc fusion region peptide 
or leader or secretory sequence which is employed for purification of the mature polypeptide or a pro-protein 
25 sequence. Such fragments, derivatives and analogs are deemed to be within the scope of the invention. 

[0030] Of particular interest are substitutions of charged amino acids with another charged amino acid and with 
neutral or negatively charged amino acids. The latter results in proteins with reduced positive charge to improve the 
characteristics of the HC polypeptide, for example for prevention of aggregation. k 
30 [0031] As indicated, changes are preferably of a minor nature, such as conservative amino acid substitutions that 
do not significantly affect the activity of the polypeptide. 

[0032] Examples of amino acids which can be used for substitution are given hereafter 
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[0033] Generally, the number of amino acid substitutions will not be more than 50. 

[0034] Amino acids in the HC polypeptide of the present invention that are essential for function can be identified by 
methods known in the art, such as site-directed mutagenesis or alanine-scanning mutagenesis (Cunnigham and Wells, 
Science 244 : 1081-1085 (1989)). The latter procedure introduces single alanine mutations at every residue in the 
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molecule. The resulting mutant molecules are then tested for biological activity such as receptor binding or in vitro 
proliferative activity. Sites that are critical for ligand-receptor binding can also be determined by structural analysis, e. 
g. crystallization, nuclear magnetic resonance or photoaffinity labeling (Smith et al. t J. Mol. Bioi. 224 : 899-904 (1992); 
de Vos et al., Science 255 : 306-312 (1992)). 

5 [0035] As to the selection of peptides or polypeptides bearing an antigenic epitope, it is well known that relatively 
short synthetic sequences that mimic part of a protein sequence are routinely capable of eliciting an antiserum that 
reacts with the partially mimicked protein. See for instance Sutcliffe, J. G„ Shinnick, T. M., Green, N. and Learner, R. 
A., Antibodies that React with Predetermined Sites on Proteins, Science 219 : 660-666 (1983). Peptides capable of 
eliciting protein-reactive sera are frequently represented in the primary sequence of a protein, can be characterized 

10 by a set of simple chemical rules, and are confined neither to immunodominant regions of intact proteins (i.e. immu- 
nogenic epitopes) nor to the amino or carboxyl terminals. 

[0036] Antigenic epitope-bearing peptides and polypeptides of the invention are therefore useful to raise antibodies, 
including monoclonal antibodies, that bind specifically to a polypeptide of the invention. Antigenic epitope-bearing 
peptides and polypeptides of the invention preferably contain a sequence of at least seven, more preferably at least 
15 nine and most preferably between about 15 to about 50 amino acids contained within the amino acid sequence of a 
polypeptide according to the invention. 

[0037] For example, antigenic polypeptides or peptides that can be used to generate specific HC antibodies 
comprise : a polypeptide comprising amino acid residues from about 255 to about 272 of SEQ ID NO:2, a polypeptide 
comprising amino acid residues from about 475 to about 490 of SEQ ID NO:2. As indicated above, it is believed that 
20 the above polypeptide fragments are antigenic regions of the HC polypeptide as determined using Protean (version 
3.16) from DNASTAR. In particular, polypeptides comprising amino acid residues from 255 to 272 and amino acid 
residues from 475 to 490 of SEQ ID NO:2 were successfully used to raise polyclonal antibodies in rabbits, which 
antibodies are able to recognize HC in homogenates of human brain samples. 

[0038] The epitope-bearing peptides and polypeptides of the invention can be prepared by any conventional means, 

25 for instance by the method described in document US Patent N° 4,631 ,211 . 

[0039] HC polypeptides of the present invention and the epitope-bearing fragments thereof can be combined with 
parts of the constant domain of immunoglobulins (IgG), resulting in chimeric polypeptides. These fusion proteins facil- 
itate the purification and generally show an increased half life in vivo. See for instance Traunecker et al., Nature 331 : 
84-86 (1988). Fusion proteins that have a disulfide-linked dimeric structure due to the IgG part can also be more efficient 

30 in binding and neutralizing other molecules than the monomeric HC protein or protein fragment thereof (Fountoulakis 
et al., J. Biochem. 270 : 3958-3964 (1995)). 

[0040] The HC polypeptides may be in the form of the "mature" protein or may be a part of a larger protein such as 
~a fusion protein. It is often advantageous to include an additional amino acid sequence which contains secretory or 
leader sequences, pro-sequences, sequences which aid in purification such. as multiple histidine residues, or an ad- 

35 ditional sequence for stability during recombinant production. 

[0041] Fragments of the HC polypeptides or variants, mutated forms, derivatives thereof as here above mentioned 
are also included in the invention. By a fragment is meant a polypeptide having an amino acid sequence that entirely 
is the same as part, but not all, of the amino acid sequence of the aforementioned HC polypeptides. As with HC 
polypeptides, fragments may be "free-standing," or comprised within a larger polypeptide of which they form a part or 

40 region, most preferably as a single continuous region. Representative examples of polypeptide fragments of the in- 
vention, include, for example, fragments from about amino acid number 1-20, 21-40, 41-60, 61-80, 81-100, and 101 
to the end of HC polypeptide shown in SEQ ID NO:1. In this context "about" includes the particularly recited ranges 
larger or smaller by several, for example 5, 4, 3, 2 or 1 amino acid at either extreme or at both extremes. 
[0042] Preferred fragments include, for example, truncation polypeptides having the amino acid sequence of HC 

^5 polypeptides, except for deletion of a continuous series of residues that includes the amino terminus, or a continuous 
series of residues that includes the carboxyl terminus or deletion of two continuous series of residues, one including 
the amino terminus and one including the carboxyl terminus. Also preferred are fragments characterized by structural 
or functional attributes such as fragments that comprise alpha-helix and alpha-helix forming regions, beta-sheet and 
beta-sheet-forming regions, turn and turn-forming regions, coil and coil-forming regions, hydrophilic regions, hydro- 

50 phobic regions, alpha amphipathic regions, beta amphipathic regions, flexible regions, surface-forming regions, sub- 
strate binding region, and high antigenic index regions. Other preferred fragments are biologically active fragments. 
Biologically active fragments are those that mediate HC activity, including those with a similar activity or an improved 
activity, or with a decreased undesirable activity. Also included are those that are antigenic or immunogenic in an 
animal, especially in a human. 

55 [0043] Preferably, all of these polypeptide fragments retain the biological activity of HC, including antigenic activity. 
[0044] The HC polypeptides of the invention can be prepared in any suitable manner. Such polypeptides include 
isolated naturally occurring polypeptides, recombinantly produced polypeptides, or polypeptides produced by a com- 
bination of these methods. Means for preparing such polypeptides are well understood in the art. 
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[0045] In another embodiment, the HC polypeptides of the invention can be prepared by chemical synthesis, for 
example by using non natural and/or modified amino acids. Thus, it can be advantageous to use non natural amino 
acids, for example D-amino acids, to enhance life time of the polypeptides, or analogs of amino acids, for example 
sulfurized forms of amino acids. 

5 [0046] Methods for preparing such polypeptides are known in the art. 

[0047] For example, chemical synthesis of the polypeptides of the invention can be carried out by using a solid phase 
process. In such a method, the alpha-amino moiety of an amino acid is protected by using conventional protective 
groups (e.g. BOC or FMOC) and the thus protected amino acid is fixed onto an inert solid support by its C-terminal 
region. At the end of this step, the protective group is removed and a second amino acid, optionally protected in a 

10 similar manner, is fixed. The N-terminal protective groups are removed after each amino acid is fixed, protective groups 
possibly present on the lateral chains being conserved. When the peptide chain is synthesized, the peptide is cleaved 
from the support and the remaining protective groups are removed. Coupling can be carried out by using N.N'-diiso- 
propyl-carbodiimine (DIC), among others, in a solvent of DMF or DCM. This solid phase synthesis is described for 
example in Stewart J. M. et al„ Solid Phase Peptides Synthesis. Pierce Chem. Company, Rockford, 111, 2 nd Ed., (1984). 

is [0048] Other polypeptides according to the invention comprise the amino acid sequence of the HC polypeptide which 
possesses the complete amino acid sequence encoded by the cDNA clone contained in ATCC deposit number PTA- 
541 as discussed below, as well as polypeptides which comprise the amino acid sequence of the mature HC polypeptide 
which possesses the complete amino acid sequence encoded by the cDNA clone contained in ATCC deposit number 
PTA-541. 

20 [0049] In another aspect, the invention provides a peptide or polypeptide comprising an epitope-bearing portion of 
a polypeptide of the invention. The epitope of this polypeptide portion is an immunogenic or antigenic epitope of a 
polypeptide described in the present invention. An immunogenic epitope is defined as a part of a protein that elicits an 
antibody response when the whole protein is the immunogen. On the other hand, a region of a protein molecule to 
which an antibody can bind is defined as an antigenic epitope. 

25 [0050] Another aspect of the invention relates to HC polynucleotides. 

[0051] Unless otherwise indicated, all nucleotide sequences determined by sequencing a DNA molecule herein have 
been determined by using an automated DNA sequencer, and all amino acid sequences of polypeptides encoded by 
DNA sequences determined herein have been predicted by translation of a DNA sequence determined as above. 
Therefore, as is known in the art for any DNA sequence determined by this approach, any nucleotide sequence may 

30 contain some errors. Nucleotide sequences determined by automation are typically at least about 95 % identical to 
about 99.9 % identical to the actual nucleotide sequence of the sequenced DNA molecule. 

[0052] HC polynucleotides include isolated polynucleotides which encode the HC polypeptides and fragments, and 
polynucleotides closely related thereto. More specifically, HC polynucleotides of the invention include a polynucleotide 

— comprising the nucleotide sequence set forth in SEQJD NO:1 encoding a HC polypeptide of SEQ ID NO: 2, and a 
35 polynucleotide having the particular sequence of SEQ ID NO:1 . HC polynucleotides of the invention further include a 

- . .. polynucleotide comprising a nucleotide sequence that has at least 80% identity to a nucleotide sequence:encoding 

the HC polypeptide of SEQ ID NO:2 over its entire length, and a polynucleotide that is at least 80% identical to that 
having SEQ ID NO:1 over its entire length. In this regard, polynucleotides at least 90% identical are particularly pre- 
ferred, and those with at least 95% identity are especially preferred. Furthermore, those with at least 97% identity are 

40 highly preferred and those with at least 98-99% identity are most highly preferred, with at least 99% identity being the 
most preferred. Also included under HC polynucleotide is a nucleotide sequence which has sufficient identity to the 
nucleotide sequence contained in SEQ ID NO:1 or contained in the cDNA insert of the plasmid deposited with the 
ATCC Deposit number PTA-541 discussed below to hybridize under conditions useable for amplification or for use as 
a probe or marker. Moreover, HC polynucleotides include a nucleotide sequence having at least 80% identity to a 

45 nucleotide sequence encoding the HC polypeptide, either in a mature form or not, expressed by the cDNA insert 
deposited at the ATCC with Deposit Number PTA-541 as discussed below, and a nucleotide sequence comprising at 
least 15 contiguous nucleotides of such cDNA insert. In this regard, polynucleotides at least 90% identical are partic- 
ularly preferred, and those with at least 95% identity are especially preferred. Furthermore, those with at least 97% 
identity are highly preferred and those with at least 98-99% identity are most highly preferred, those with at least 99% 

so identity being the most preferred. The invention also provides polynucleotides which are complementary to all HC 
polynucleotides according to the invention. 

[0053] A deposit containing a human HC cDNA has been deposited with the American Type Culture Collection 
(ATCC), 10801 University Blvd, Manassas, VA 20110-2209, USA, on August 17, 1999, and assigned ATCC Deposit 
Number PTA-541 . The deposited material (clone) is pCDNA3 that further contains the full length cDNA of HC, referred 
55 to as Human plasmid SGC-1 1 5 upon deposit. The nucleotide sequence of the polynucleotide contained in the deposited 
material, as well as the amino acid sequence of the polypeptide encoded thereby, are controlling in the event of any 
conflict with any description of sequences herein. 

[0054] The deposit has been made under the terms of the Budapest Treaty on the international recognition of the 
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deposit of micro-organisms for purposes of patent procedure. 

[0055] HC of the invention is structurally related to other proteins of the Novel protease-like protein family, as shown 
by the results of sequencing the cDNA of SEQ ID NO:1 encoding HC. The cDNA sequence set forth in SEQ ID NO:1 
contains an open reading frame (nucleotide, numbers 1 to 1524) encoding a polypeptide of about 508 amino acids 

s residues of SEQ ID NO:2, with a predicted leader sequence of about 28 amino acid residues and a deduced molecular 
weight of about 58.6 KDa. Amino acid sequence set forth in SEQ ID NO:2 has about 45 % identity (using Megalign 
with the Clustal algorithm from DNASTAR) in 470 amino acid residues with a yeast hypothetical 52.9 KD protein. 
Nucleotide sequence set forth in SEQ ID NO:1 has about 59.6% identity (Megalign with the Clustal algorithm from 
DNASTAR) in 1256 nucleotide residues with the yeast hypothetical protein 52.9 KD (CAB07646). 

10 [0056] The polynucleotides of the present invention encoding HC may be obtained using standard cloning and 
screening, from a cDNA library derived from mRNA in cells of human^brain using the expressed sequence tag (EST) 
analysis (Adams, M. D., et al. Science (1991) 252:1651-1656; Adams, M.. etai, Nature, (1992) 355:632-634; Adams, 
M. D., et a/., Nature (1995) 377 Supp:3-174). Polynucleotides of the invention can also be obtained from natural sources 
such as genomic DNA libraries or can be synthesized using well known and commercially available techniques. 

is [0057] The nucleotide sequence encoding HC polypeptide of SEQ ID NO:2 may be identical to the polypeptide en- 
coding sequence contained in SEQ ID NO:1, or it may be a different sequence, which as a result of the redundancy 
(degeneracy) of the genetic code, also encodes the polypeptide of SEQ ID NO:2. 

[0058] As indicated, the invention also provides the mature form of the HC polypeptide of the present invention. 
According to the signal hypothesis, proteins secreted by mammalian ceils have a signal or secretory leader sequence 

20 which is cleaved from the mature protein once export of the growing protein chain across the rough endoplasmic 
reticulum has been initiated. Most mammalian cells and even insect cells cleave secreted proteins with the same 
specificity. However, in some cases, cleavage of a secreted protein is not entirely uniform, which results in two or more 
mature species on the protein. Further, it has long been known that the cleavage specificity of a secreted protein is 
ultimately determined by the primary structure of the complete protein, i.e. its amino acid sequence. Therefore, the 

25 invention provides a nucleotide sequence encoding the mature HC polypeptide having the amino acid sequence en- 
coded by the cDNA clone contained in the host identified as ATCC Deposit PTA-541. By the mature HC polypeptide 
having the amino acid sequence encoded by the cDNA clone contained in the host identified as ATCC Deposit PTA- 
541 is meant the mature form of HC polypeptide produced by expression in a mammalian cell of the complete open 
. . . reading frame encoded by the human DNA sequence of the clone contained in the vector in the deposited host. As . 

30 indicated, the mature HC having the amino acid sequence encoded by the cDNA clone contained in ATCC Deposit 
PTA-541 may or may not differ from the predicted "mature" HC protein represented in SEQ ID NO:2, depending on 
the accuracy of the predicted cleavage site based on computer analysis. 

- - —[0059] Methods for predicting whether a protein has a secretory leader as well as the cleavage site of this leader 
_ : * - sequence are available. For.instance,.the methods of Mc Geoch (Virus Res. 3:271 r286 (.1985)) and von Heinje (Tvuc/e/c./ 

.35 Acids Res. 14:4683-4690 (1986)) can be used. 

: [0060]~ .When the polynucleotides of the invention are used for. the recombinant production of HC polypeptide, the 
polynucleotide may include the coding sequence for the mature polypeptide or a fragment thereof, by itself; the coding 
sequence forthe mature polypeptide or fragment in reading frame with other coding sequences, such as those encoding 
a leader or secretory sequence, a pre-, or pro- or prepro-protein sequence, or other fusion peptide portions. For ex- 

40 ample, a marker sequence which facilitates purification of the fused polypeptide can be encoded. In certain preferred 
embodiments of this aspect of the invention, the marker sequence is a hexa-histidine peptide, as provided in the pQE 
vector (Qiagen, Inc.) and described in Gentz et a/., Proc. Natl. Acad. Sci. USA (1989) 86:821-824, or is an HA or V5 
epitope tag. The polynucleotide may also contain non-coding 5' and 3* sequences, such as transcribed, non-translated 
sequences, splicing and polyadenylation signals, ribosome binding sites and sequences that stabilize mRNA. 

45 [0061] Further preferred embodiments are polynucleotides encoding HC variants comprising the amino acid se- 
quence HC polypeptide of SEQ ID NO:2 in which several, for example 5-10, 1-5, 1-3, 1-2 or 1 amino acid residues are 
substituted, deleted or added, in any combination. 

[0062] The invention is further directed to fragments of the isolated nucleic acid molecules described herein. By a 
fragment of an isolated nucleic acid molecule having the nucleotide sequence of the deposited cDNA or the nucleotide 

so molecule having the nucleotide sequence shown in SEQ ID NO:1 is intended fragments of at least about 15 nucleotides, 
and preferably of at least about 20 nucleotides, more preferably of at least about 30 nucleotides, even more preferably 
of at least about 40 nucleotides, which are useful as diagnostic probes and primers. By a fragment of at least 20 
nucleotides in length, for example, are intended a fragment which includes 20 or more contiguous bases from the 
nucleotide sequence of the deposited cDNA or from the sequence shown in SEQ ID NO:1. 

55 [0063] In a first aspect, preferred nucleic acid fragments according to the invention comprise those which can be 
used as nucleic probes. 

[0064] For example, a nucleic probe according to the invention comprises the nucleotide sequence located between 
nucleotide number 1 and nucleotide number 500 of the sequence set forth in SEQ ID NO:1. 
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[0065] Another nucleic probe according to the invention comprises the nucleotide sequence located between nucle- 
otide number 501 and nucleotide number 1524 of the sequence set forth in SEQ ID NO:1. 

[0066] In a second aspect, preferred nucleic acid fragments according to the invention comprise a nucleotide se- 
quence encoding for an antigenic region of HC. 
5 [0067] For example, preferred nucleic acid fragments according to the invention comprise nucleic acid molecules 
which encode a polypeptide comprising amino acid residues from about 256 to about 272 of the amino acid sequence 
shown in SEQ ID NO:2, nucleic acid molecules which encode a polypeptide comprising amino acid residues from about 
475 to about 490 of the amino acid sequence of SEQ ID NO:2. 

[0068] The present invention further relates to polynucleotides that hybridize to the herein above-described sequenc- 
10 es. In this regard, the present invention especially relates to polynucleotides which hybridize under stringent conditions 
to the herein above-described polynucleotides. As herein used, the term "stringent conditions" means hybridization 
will occur only if there is at least 95% and preferably at least 97% identity between the sequences. 
[0069] Polynucleotides of the invention, which are identical or sufficiently identical to a nucleotide sequence contained 
in SEQ ID NO:1 or a fragment thereof, may be used as hybridization probes for cDNA and genomic DNA, to isolate 
15 full-length cDNAs and genomic clones encoding HC polypeptide and to isolate cDNA and genomic clones of other 
genes that have a high sequence similarity to the HC gene. Such hybridization techniques are known to those of skill 
in the art. Typically these nucleotide sequences are 80% identical, preferably 90% identical, more preferably 95% 
identical to that of the referent. The probes generally will comprise at least 15 nucleotides. Preferably, such probes will 
have at least 30 nucleotides and may have at least 50 nucleotides. Particularly preferred probes will range between 
20 30 and 50 nucleotides. 

[0070] In one embodiment, to obtain a polynucleotide encoding HC polypeptide comprises the steps of screening 
an appropriate library under stringent hybridization conditions with a labeled probe having the SEQ ID NO: 1 or a 
fragment thereof; and isolating full-length cDNA and genomic clones containing said polynucleotide sequence. Such 
hybridization techniques are well known to those of skill in the art. Stringent hybridization conditions are as defined 
25 above or alternatively conditions under overnight incubation at 42° C in a solution comprising : 50% formamide, 5xSSC 
(150 mM NaCI, 15 mM trisodium citrate), 50 mM sodium phosphate (pH 7.6), 5x Denhardt's solution, 10 % dextran 
sulfate, and 20 microgram/ml denatured, sheared salmon sperm DNA, followed by washing the filters in O.lxSSC at 
about 65°C. 

[0071] The polynucleotides and polypeptides of the present invention may be employed as research reagents and 

30 materials for discovery of treatments and diagnostics of animal and human diseases. 

[0072] The present invention also relates to recombinant vectors which comprise a polynucleotide or polynucleotides 
according to the invention and expression system capable of producing HC polypeptide when said expression system 
is present in a compatible host cell; host cells which are genetically engineered with- vectors of the invention. The 
invention also relates to the production of polypeptides of the invention by recombinant techniques. Cell-free translation 

35 systems can also be employed to produce such proteins using RNAs derived from the DNA constructs of the present 
invention. ■ .r. 

[0073] For recombinant production, host cells can be genetically engineered to incorporate expression systems or 
portions thereof for polynucleotides of the present invention. Introduction of polynucleotides into host cells can be 
effected by methods described in many standard laboratory manuals, such as Davis et al., Basic Methods in Molecular 
40 Biology (1 986) and Sambrook et al., Molecular Cloning: a Laboratory Manual, 2nd Ed., Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, N.Y. (1989) such as calcium phosphate transfection, DEAE-dextran mediated transfection, 
transvection, microinjection, cationic lipid-mediated transfection, electroporation, transduction, scrape loading, ballistic 
introduction or infection. 

[0074] The expression vectors will preferably include at least one selectable marker. Such markers include dihydro- 
45 folate reductase or neomycin resistance for eukaryotic cell culture and tetracycline or ampicillin resistance genes for 
culturing E. colt and other bacteria. 

[0075] The DNA insert should preferably be linked to an appropriate promoter, such as the bacterial phage lambda 
PL and RP, T3 and T7, E. coli lac, trp and tac promoters, the eukaryotic SV40 early and late promoters, CM V immediate 
early promoters, eukaryotic promoters from retroviral LTRs, such as those of the Rous Sarcoma Virus (RSV) to name 

50 a few. Other suitable promoters are available for the skilled artisan. 

[0076] The expression constructs will further contain sites for transcription initiation, termination and, in the tran- 
scribed region, a ribosome binding site for translation. The coding region of the mature transcripts expressed by the 
constructs will preferably include a translation initiating at the beginning and a termination codon (UAA, AGA or UAG) 
appropriately positioned at the end of the polypeptide to be translated. 

55 [0077] Representative examples of appropriate hosts include bacterial cells, such as streptococci, staphylococci, E. 
coli, Streptomyces, Salmonella typhimurium and Bacillus subtilis cells; fungal cells, such as yeast cells and Aspergillus 
cells; insect cells such as Drosophila S2 and Spodoptera Sf9 cells; animal cells such as CHO, COS, HeLa, C127, 3T3, 
BHK, HEK 293 and Bowes melanoma cells; and plant cells. 
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[0078] A great variety of expression systems can be used. Such systems include, among others, chromosomal, 
episomal and virus-derived systems, e.g., vectors derived from bacterial plasmids, from bacteriophage, from trans- 
posons, from yeast episomes, from insertion elements, from yeast chromosomal elements, from viruses such as bac- 
uloviruses, papova viruses, such as SV40, vaccinia viruses, adenoviruses, fowl pox viruses, pseudorabies viruses and 
5 retroviruses, and vectors derived from combinations thereof, such as those derived from plasmid and bacteriophage 
genetic elements, such as cosmids and phagemids. Generally, any system or vector suitable to maintain, propagate 
or express polynucleotides, for producing a polypeptide in a host, may be used. 

[0079] Among vectors preferred for use in bacteria are pGE70, pGE60 and pG-9, available form Qiagen, Phagescript 
vectors, pBS vectors, Bluescript vectors, pNH8A, pNH16a, pNH18A, pNH46A, available from Stratagene, ptrc99a, 
10 pKK223-3, pDR540, pRIT5 available from Pharmacia. 

[0080] Among preferred eukaryotic vectors are pWLNEO, pSV2CAT, pOG44, pXT1 and pSG available from Strata- 
gene, pSVK3, pBPV, pMSG and pSVL available from Pharmacia, pcDNA3, pcDNA3.1 and pTracer™ available from 
Invitrogene. Other suitable vectors will be readily apparent to the skilled in the art. 

[0081] The appropriate nucleotide sequence may be inserted into an expression system by any of a variety of well- 
15 known and routine techniques, such as, for example, those set forth in Sambrook et a/., Molecular Cloning, a Laboratory 
Manual (supra). 

[0082] For secretion of the translated protein into the lumen of the endoplasmic reticulum, into the periplasmic space 
or into the extracellular environment, appropriate secretion signals may be incorporated into the desired polypeptide. 
These signals may be endogenous to the polypeptide or they may be heterologous signals. 

20 [0083] If the HC polypeptide is to be expressed for use in screening assays, generally, it is preferred that the polypep- 
tide be produced at the surface of the cell. In this event, the cells may be harvested prior to use in the screening assay. 
If HC polypeptide is secreted into the medium, the medium can be recovered in order to recover and purify the polypep- 
tide; if produced intracellular^, the cells must first be lysed before the polypeptide is recovered. 
[0084] HC polypeptides can be recovered and purified from recombinant cell cultures by well-known methods in- 

25 eluding ammonium sulfate or ethanol precipitation, acid extraction, anion or cation exchange chromatography, phos- 
phocellulose chromatography, hydrophobic interaction chromatography, affinity chromatography, hydroxylapatite chro- 
matography and lectin chromatography. Most preferably, high performance liquid chromatography is employed for 
purification. Well known techniques for refolding proteins may be employed to regenerate active conformation when 
- the polypeptide is denatured during isolation and or purification. : . 

30 [0085] The polypeptide may be expressed in a modified form, such as a fusion protein, and may include not only 
secretion signals, but also additional heterologous functional regions. For instance, a region of additional amino acids, 
particularly charged amino acids, may be added to the N-terminus of the polypeptide to improve stability and persistence 

■ in the host cell, during purification or during subsequent handling and storage. Also, peptide moieties may be added 
"... " to the polypeptide to facilitate purification. Such regions may.be removed prior to final preparation of the polypeptide. 

35 The addition of peptide moieties to polypeptides to engender secretion or excretion, to improve stability and to facilitate 
purification, among others, are familiar and routine techniques in the art. A preferred fusion protein comprises a het- 
erologous region from immunoglobulin that is useful to .solubilize proteins. For example, EP-A-0 464 533 discloses 
fusion proteins comprising various portions of constant region of immunoglobulin molecules together with another 
human protein or part thereof. In many cases, the Fc part in a fusion protein is advantageous for use in therapy and 

40 diagnosis. On the other hand, for some uses, it would be desirable to delete the Fc part after the fusion protein has 
been expressed, detected and purified. For instance, it is the case when Fc portion proves to be a hindrance to use 
in therapy and diagnosis, for example when the fusion protein is to be used as antigen for immunizations. 
[0086] It is believed that mammals with certain diseases or susceptibility to certain diseases express significantly 
altered, enhanced or diminished levels of HC and mRNA encoding HC, when compared to a corresponding "standard" 

45 mammal, i.e. a mammal of the same species not having or not presenting a susceptibility to the disease. Further, it is 
believed that significantly altered, enhanced or diminished levels of HC can be detected in certain body tissues (e.g. 
bone marrow, cerebellum, brain, midbrain, spinal cord, nerve endings, retina, breast, pituitary, heart, lung, skeletal 
muscle, kidney, pancreas, intestine, stomach) from mammals with the disease or presenting a susceptibility to the 
disease when compared to tissues from mammals of the same species not having or not presenting a susceptibility 

50 to the disease. Thus, the invention provides a diagnostic method which involves assaying the expression level of the 
gene encoding a HC polypeptide of the present invention in mammalian cells or body fluid and comparing the gene 
expression level with a standard HC expression level, whereby an altered, enhanced or diminished expression level 
of the gene with respect to the standard is indicative of the disease or of a susceptibility to the disease. Preferred 
mammals include monkeys, apes, cats, dogs, cows, pigs, horses, rabbits and humans. 

55 [0087] Diseases which can be diagnosed include but are not limited to Alzheimer's disease and cognitive disorders, 
developmental abnormalities and fetal deficiencies, neurodegenerative disorders such as amyotrophic lateral sclerosis, 
Parkinson's disease, schizophrenia, abnormal mental states, ischemic shock. 

[0088] By "assaying the expression level of the gene encoding a HC polypeptide" is intended qualitatively or quan- 
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titatively measuring or estimating the level of HC or the level of the mRNA encoding HC in a first biological sample 
either directly (e.g. by determining or estimating absolute protein level or mRNA level) or relatively (e.g. by comparing 
with the HC polypeptide level or mRNA level in a second biological sample). 

[0089] Preferably, the HC level or mRNA level in the first biological sample is measured or estimated and compared 
5 with a standard HC level or mRNA level, the standard being taken from a second biological sample obtained from an 
individual not having (nor presenting a susceptibility to) the disease. As will be appreciated, once a standard HC level 
or mRNA level is known, it can be used repeatedly as a standard for comparison. 

[0090] By "biological sample" is intended any biological sample, either tissue sample or body fluid, obtained from an 
individual, cell line, tissue culture, or other source which may contain HC polypeptide or mRNA. 

10 [0091] Total cellular RNA can be isolated from a biological sample using the single-step guanidinium-thiocyanate- 
phenol-chloroform method described in Chomczynski and Sacchi, Anal. Biochem. 162 : 156-159 (1987). Levels of 
mRNA encoding HC are then assayed using any appropriate method. These include Northern Blot analysis (Harada 
et al., Ce//63 : 303-312 (1990)), S1 nuclease mapping (Fujita et al., Ce//49 : 357-367 (1987)), the polymerase chain 
reaction (PCR), reverse transcription in combination with the polymerase chain reaction (RT-PCR) (Fujita et al, Cell 

15 49 : 357-367 (1987)), and reverse transcription in combination with the ligase chain reaction (RT-LCR). 

[0092] Assaying HC polypeptide levels in a biological sample can occur using antibody-based techniques. For ex- 
ample, HC expression in tissues can be studied with classical immunohistological methods (Jalkanen, M., et al., J. 
Cell. Biol. 101 : 976-985 (1985); Jalkanen, M., et al., J. Cell. Biol. 105: 3087-3096 (1987)). Other antibody-based 
methods useful for detecting HC gene expression include immunoassays, such as the enzyme linked immunosorbent 

20 assay (ELISA) and the radioimmunoassay (RIA). 

[0093] Suitable labels are known in the art and include enzyme labels, such as glucose oxidase, and radioisotopes, 
such as iodine ( 125 |, 121 1), carbon ( 14 C), sulfur ( 35 S), tritium ( 3 H), indium ( 112 |n), and technetium ( 99m Tc), and fluorescent 
labels, such as fluorescein, rhodamine and biotin. 

[0094] In addition, the invention provides a diagnostic assay for diagnosing or determining a susceptibility to, among 
25 others, Alzheimer's disease and cognitive disorders, developmental abnormalities and fetal deficiencies, neurodegen- 
erative disorders such as amyotrophic lateral sclerosis, Parkinson's disease, schizophrenia, abnormal mental states, 
ischemic shock through detection of mutation in the HC gene by the methods described below. 
[0095] Nucleic acids for diagnosis may be obtained from a subject's cells, such as from blood, urine, saliva, tissue 
biopsy or autopsy material. The genomic DNA may be used directly for detection or may be amplified enzymatically 
30 by using PCR or other amplification techniques prior to analysis. RNA or cDNA may also be used in similar fashion. 
Deletions and insertions can be detected by a change in size of the amplified product in comparison to the normal 
genotype. Point mutations can be identified by hybridizing amplified DNA to labeled HC nucleotide sequences. Perfectly 
■■■ matched sequences can be distinguished from mismatched duplexes by RNase digestion or by differences in melting 
. - temperatures.. DNA sequence differences may also be detected by alterations in.electrophoretiamobility of DNA frag- 
35 ments in gels, with or without denaturing agents, or by direct DNA sequencing. See, e.g., Myers et al., Science (1985) 
230:1242. Sequence changes at specific locations may also be revealed by nuclease protection assays, such as RNase 
and S1 protection or the chemical cleavage method. See Cotton et al., Proc Natl Acad Sci USA (A 985) 85: 4397-4401 . 
In another embodiment, an array of oligonucleotides probes comprising HC nucleotide sequence or fragments thereof 
can be constructed to conduct efficient screening of e.g., genetic mutations. Array tech no logy methods are well known 
40 and have general applicability and can be used to address a variety of questions in molecular genetics including gene 
expression, genetic linkage, and genetic variability. (See for example : M.Chee et al., Science, Vol 274, p 610-613 
(1996)). 

[0096] The nucleotide sequences of the present invention are also valuable for chromosome identification. The se- 
quence is specifically targeted to and can hybridize with a particular location on an individual human chromosome. 

45 The mapping of relevant sequences to chromosomes according to the present invention is an important first step in 
correlating those sequences with gene associated disease. Once a sequence has been mapped to a precise chromo- 
somal location, the physical position of the sequence on the chromosome can be correlated with genetic map data. 
Such data are found, for example, in V. McKusick, Mendelian Inheritance in Man (available on line through Johns 
Hopkins University Welch Medical Library). The relationship between genes and diseases that have been mapped to 

50 the same chromosomal region are then identified through linkage analysis (coinheritance of physically adjacent genes). 
[0097] The differences in the cDNA or genomic sequence between affected and unaffected individuals can also be 
determined. If a mutation is observed in some or all of the affected individuals but not in any normal individuals, then 
the mutation is likely to be the causative agent of the disease. 

[0098] The polypeptides of the invention or their fragments or analogs thereof, or cells expressing them can also be 
55 used as immunogens to produce antibodies immunospecific for the HC polypeptides. The term "immunospeciflc , • means 
that the antibodies have substantial greater affinity for the polypeptides of the invention than their affinity for other 
related polypeptides. 

[0099] Antibodies generated against HC polypeptides can be obtained by administering the polypeptides or epitope- 
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bearing fragments, analogs or cells to an animal, preferably a non human, using routine protocols. For preparation of 
monoclonal antibodies, any technique which provides antibodies produced by continuous cell line cultures can be used. 
Examples include the hybridoma technique (Kohler, G. and Milstein, C, Nature (1975) 256:495-497), the trioma tech- 
nique, the human B-cell hybridoma technique (Kozbor et a/., Immunology Today (1983) 4:72) and the EBV-hybridoma 
5 technique (Cole ef a/., Monoclonal Antibodies and Cancer Therapy, pp. 77-96, Alan R. Liss, Inc., 1985). 

[01 00] Techniques for the production of single chain antibodies (U.S. Patent No. 4,946,778) can also be adapted to 
produce single chain antibodies to polypeptides of the invention. Also, transgenic mice, or other organisms including 
other mammals, may be used to express humanized antibodies. 

[0101] The above-described antibodies may be employed to isolate or to identify clones expressing the polypeptide 

10 or to purify the polypeptides by affinity chromatography. 

[0102] Antibodies against HC polypeptides may also be employed to treat Alzheimer's disease and cognitive disor- 
ders, developmental abnormalities and fetal deficiencies, neurodegenerative disorders such as amyotrophic lateral 
sclerosis, Parkinson's disease, schizophrenia, abnormal mental states, ischemic shock., epilepsia, among others. 
[0103] Another aspect of the invention relates to a method for inducing an immunological response in a mammal 

15 which comprises inoculating the mammal with HC polypeptide, or a fragment thereof, adequate to produce antibody 
and/or T cell immune response to protect said animal from Alzheimer's disease and cognitive disorders, developmental 
abnormalities and fetal deficiencies, neurodegenerative disorders such as amyotrophic lateral sclerosis, Parkinson's 
disease, schizophrenia, abnormal mental states, ischemic shock, epilepsia, among others. Yet another aspect of the 
invention relates to a method of inducing an immunological response in a mammal which comprises, delivering a HC 

20 polypeptide via a vector directing expression of HC polynucleotide in vivo in order to induce such an immunological 
response to produce antibody to protect said animal from diseases. 

[01 04] A further aspect of the invention relates to an immunological/vaccine formulation (composition) which, when 
introduced into a mammalian host, induces an immunological response in that mammal to a HC polypeptide, wherein 
the composition comprises a HC polypeptide or HC gene. The vaccine formulation may further comprise a suitable 

25 carrier. Since HC polypeptide may be broken down in the stomach, it is preferably administered parenterally (including 
subcutaneous, intramuscular, intravenous, intradermal etc. injection). Formulations suitable for parenteral administra- 
tion include aqueous and non-aqueous sterile injection solutions which may contain anti-oxidants, buffers, bacteriostats 
and solutes which render the formulation isotonic with the blood of the recipient; and aqueous and non-aqueous sterile 
suspensions which may include suspending agents or thickening agents. The formulations may be presented in unit- 

30 dose or multi-dose containers, for example, sealed ampoules and vials and may be stored in a freeze-dried condition 
requiring only the addition of the sterile liquid carrier immediately prior to use. The vaccine formulation may also include 
adjuvant systems for enhancing the immunogenicity of the formulation, such as oil-in-water systems and other systems 

.- - . - - - k nown i n the art. The dosage will depend on the specific activity ofthe vaccine and can be readily determined by routine 

*:- experimentation. „ : - : _ \ _ .": 

35 . [0105] HC polypeptides of the present invention may be employed.in a screening process for candidate compounds 
. .... which, directly or indirectly, activate (agonists) or inhibit activation of (antagonists, or otherwise called inhibitors) HC 
polypeptide of the present invention. Thus, the polypeptides of the invention may also be used to identify agonists or 
r antagonists from, for example, cells, cell-free preparations, chemical libraries, and natural product mixtures. These 
agonists or antagonists may be natural substrates, ligands, receptors, etc, as the case may be, of the polypeptide of 

40 the present invention; or may be structural or functional mimetics of the polypeptide of the present invention. See 
Coligan et ai., Current Protocols in Immunology 1 (2): Chapter 5 (1 991 ). 

[0106] The choice of the candidate compounds may be based on similarity with known agonists or antagonists of 
carnosinase. In particular, known antagonists or agonists of the metalloprotease family may be used to determine 
candidate compounds to be screened. 
45 [0107] Such known compounds include, but are not limited to, bestatine, phosphoramidon, 1,10-phenanthroline, p- 
hydroxymercuribenzoic acid, thiorphan. 

[0108] Similarity of the candidate compounds with compounds known to activate or inhibit activation of HC may be 
structural or functional, or both. For example, candidate compounds to be screened may be chosen starting from their 
chemical resemblance to known compounds. Other candidate compounds may be chosen by taking into account their 
so functional relationship with the metalloprotease family. 

[0109] Other candidate compounds can be chosen among compounds known to be involved in a large number of 
biological processes and can be selected for screening solely on this basis. 

[0110] HC polypeptides are thought to be responsible for many biological functions, including many pathologies 
relating to an abnormal expression of HC. Accordingly, it is desirous to find compounds and drugs which stimulate HC 
55 polypeptide activity or its expression (agonists) on the one hand and which can inhibit HC polypeptide activity or its 
expression (antagonists, inhibitors) on the other hand, for therapeutic and prophylactic purposes for such conditions 
as Alzheimer's disease and cognitive disorders, developmental abnormalities and fetal deficiencies, neurodegenerative 
disorders such as amyotrophic lateral sclerosis, Parkinson's disease, schizophrenia, abnormal mental states, ischemic 
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shock, epilepsia. In general, such screening procedures may involve using appropriate cells which express HC polypep- 
tide or respond to HC polypeptide of the present invention. Such cells include cells from mammals, yeast, Drosophila 
or E. coll Cells which express HC polypeptide (or cell membrane containing the expressed polypeptide) or respond 
to HC polypeptide are then contacted with a test compound to observe binding, stimulation or inhibition of a functional 
response. 

[0111] The assays may simply test binding of a candidate compound, wherein adherence to the cells bearing the 
HC polypeptide is detected by means of a label directly or indirectly associated with the candidate compound or in an 
assay involving competition with a labeled competitor. Further, these assays may test whether the candidate compound 
results in a signal generated by activation of the HC polypeptide, using detection systems appropriate to the cells 
bearing the HC polypeptide. Inhibitors of activation are generally assayed in the presence of a known agonist and the 
effect on activation by the agonist by the presence of the candidate compound is observed. Standard methods for 
conducting such screening assays are well understood in the art. 

[0112] Examples of potential HC polypeptide antagonists include antibodies or, in some cases, oligonucleotides or 
proteins which are closely related to the ligands, substrates, receptors, etc, as the case may be, of the HC polypeptide, 
e.g., a fragment of the ligands, substrates, receptors, or small molecules which bind to the polypeptide of the present 
invention but do not elicit a response, so that the activity of the polypeptide is prevented. 

[0113] Thus, the invention also encompasses agonists or antagonists of HC polypeptide identifiable by the methods 
above described. 

[0114] The invention also provides methods of treating abnormal conditions related to both an over-activity or an 
insufficient activity of HC. 

[0115] In a first aspect, the invention provides a method of treating an individual in need of an altered, diminished 
or inhibited level of HC activity or expression, comprising administering to such an individual a pharmaceutical com- 
position comprising an effective amount of an active ingredient capable of altering or diminishing HC activity. 
[0116] Thus this first aspect of the invention includes the use of an active ingredient capable of altering or diminishing 
HC activity, for the preparation of a medicament for treatment of conditions related to an over-activity of HC. 
[0117] In that case, several approaches are available. One approach comprises administering as active ingredient 
an inhibitor compound (antagonist) identifiable by the method above described, optionally with a pharmaceutically 
acceptable carrier, in an amount effective to inhibit activation by blocking activity of HC, or by inhibiting a second signal, 
and.thereby alleviating the abnormal condition. 

[0118] In another approach, soluble forms of HC polypeptides still capable of binding the ligand in competition with 
endogenous HC may be administered as active ingredient. Typical embodiments of such competitors comprise frag- 
ments of the HC polypeptide. 

[0119] * In still another approach, expression of the gene encoding endogenous HC can be inhibited using expression 
blocking techniques. Such known techniques involve the use of antisense sequences, either internally generated or 
separately administered. See, for example, O'Connor, J Neurochem (1991) 56:560 in Oligodeoxynucleotides as Anti- 
sense Inhibitors of Gene Expression, CRC Press, Boca Raton, FL (1988). Alternatively, oligonucleotides which form 
triple helices with the gene can be supplied. See, for example, Lee et al, Nucleic Acids Res. (1979) 6:3073; Cooney 
era/., Science (1988) 241:456; Dervan et at., Science (1991) 251:1360. These oligomers can be administered perse 
or the relevant oligomers can be expressed in vivo. 

[0120] In a second aspect, the invention provides a method of treating an individual in need of an increased level of 
HC activity or expression, comprising administering to such an individual a pharmaceutical composition comprising an 
effective amount of an active ingredient capable of increasing HC activity. 

[0121] Thus the second aspect of the invention includes the use of an active ingredient capable of increasing HC 
activity, for the preparation of a medicament for treatment of conditions related to an under-activity of HC. 
[0122] In that case, several approaches are also available. One approach comprises administering as active ingre- 
dient a therapeutically effective amount of a compound which activates HC, i.e., an agonist identifiable by the method 
described above, in combination with a pharmaceutically acceptable carrier, to thereby alleviate the abnormal condition. 
[0123] Alternatively, gene therapy may be employed to effect the endogenous production of HC by the relevant cells 
in the subject. For example, a polynucleotide of the invention may be engineered for expression in a replication defective 
retroviral vector, as discussed above. The polynucleotide constitutes in that case the active ingredient. The retroviral 
expression construct may then be isolated and introduced into a packaging cell transduced with a retroviral plasmid 
vector containing RNA encoding a polypeptide of the present invention such that the packaging cell now produces 
infectious viral particles containing the gene of interest. These producer cells may be administered to a subject for 
engineering cells in vivo and expression of the polypeptide in vivo. For overview of gene therapy, see Chapter 20, Gene 
Therapy and other Molecular Genetic-based Therapeutic Approaches, (and references cited therein) in Human Mo- 
lecular Genetics, T Strachan and A P Read, BIOS Scientific Publishers Ltd (1996). 

[0124] Pharmaceutical compositions of the invention are intended, but not limited to, the treatment and/or prevention 
of Alzheimer's disease and cognitive disorders, developmental abnormalities and fetal deficiencies, neurodegenerative 
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disorders such as amyotrophic lateral sclerosis, Parkinson's disease, schizophrenia, abnormal mental states, ischemic 
shock, epilepsia, among others. 

[0125] In a first aspect, the pharmaceutical compositions of the invention for preventing and/or treating in a subject 
a disease or susceptibility to a disease related to over-activity or over-expression of HC may comprise a therapeutically 
effective amount of at least an antagonist of HC or a derivative or homologue thereof as defined above, and/or at least 
a nucleic acid molecule that inhibits the expression of the nucleotide sequence encoding HC (anti-sense sequence) 
and/or at least a peptide which competes with said receptor, as discussed above. 

[0126] In a second aspect, the pharmaceutical compositions of the invention for preventing and/or treating in a subject 
a disease or susceptibility to a disease related to under-activity or under-expression of HC may comprise a therapeu- 
tically effective amount of at least an agonist of HC or a derivative or homologue thereof as defined above, and/or at 
least a polynucleotide according to the invention and/or at least a nucleotide sequence according to the invention able 
to express in vivo a HC polypeptide of the invention. 

[0127] The pharmaceutical compositions in which at least a polypeptide according to the invention is used may 
comprise the recombinant vectors of the invention, able to express in vivo a polypeptide according to the invention. 
Polypeptides according to the invention may be also used "naked", i.e. without being inserted into a vector. It has been 
shown in the art that DNA or RNA sequences can be expressed in some tissues without the aid of an expression vector. 
[0128] Pharmaceutical compositions according to the invention, for example those which comprise polypeptides of 
the invention, such as the soluble form of HC polypeptides or small molecule agonists or antagonists, may be formulated 
in combination with a suitable pharmaceutical carrier. Such formulations comprise a therapeutically effective amount 
of the polypeptide or compound, and a pharmaceutical^ acceptable earner or excipient. Such carriers include but are 
not limited to saline, buffered saline, dextrose, water, glycerol, ethanol, and combinations thereof. Formulation should 
suit the mode of administration, and is well within the skill of the art. The invention further relates to pharmaceutical 
packs and kits comprising one or more containers filled with one or more of the ingredients of the aforementioned 
compositions of the invention. 

[0129] Polypeptides and other compounds of the present invention may be employed alone or in conjunction with 
other compounds, such as therapeutic compounds. 

[0130] Preferred forms of systemic administration of the pharmaceutical compositions include injection, typically by 
intravenous injection. Other injection routes, such as subcutaneous, intramuscular, intrasternal, intraarterial or intra- 
peritoneal, can be used. Alternative means for systemic administration include transmucosal and transdermal admin- 
istration using penetrants such as bile salts or fusidic acids or other detergents. In addition, if properly formulated in 
enteric or encapsulated formulations, oral, rectal, intravaginal administration may also be possible. Administration of 
these compounds may also be topical and/or localized, in the form of salves, pastes, gels and the like. 
- [0131] The dosage range required depends on the choice of peptide, the route of administration, the nature of the 
formulation, the: nature of the subject's condition, and the judgment of the attending practitioner. Suitable dosages, 
however, are in the range of 1 ug/kg/day to 10 mg/kg/day of subject. More preferably, this dose is at least between 
0.01 mg/kg/day and 1 mg/kg/day. Variations in the needed dosage are to be expected in view of the nature of the 
compounds used and the differing efficiencies of various routes of administration. For example, oral administration 
would be expected to require higher dosages than administration by intravenous injection. Variations in these dosage 
levels can be adjusted using standard empirical routines for optimization, as is well understood in the art. 
[0132] Polypeptides used in treatment can also be generated endogenously in the subject, in treatment modalities 
often referred to as "gene therapy" as described above. Thus, for example, cells from a subject may be engineered 
with a polynucleotide of the invention, such as a DNA or RNA, to encode a polypeptide ex vivo, and for example, by 
the use of a retroviral plasmid vector. The cells are then introduced into the subject. 

[0133] The invention further relates to the use of HC polypeptides of the invention, for the treatment of Alzheimer's 
disease and cognitive disorders, developmental abnormalities and fetal deficiencies, neurodegenerative disorders such 
as amyotrophic lateral sclerosis, Parkinson's disease, schizophrenia, abnormal mental states, ischemic shock, epilep- 
sia, among others. 

[0134] The invention also relates to the use of a HC polypeptide, for the preparation of a pharmaceutical composition 
for the treatment of Alzheimer's disease and cognitive disorders, developmental abnormalities and fetal deficiencies, 
neurodegenerative disorders such as amyotrophic lateral sclerosis, Parkinson's disease, schizophrenia, abnormal 
mental states, ischemic shock, epilepsia, among others. 

[0135] The examples below are carried out using standard techniques, which are well known and routine to those 
of skill in the art, except where otherwise described in detail. The examples illustrate, but do not limit, the invention. 
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Example 1 

Cloning of human carnosinase. 

[0136] The sequence of HC is first identified by searching a database containing approximately 2 million human 
ESTs, which was generated using high throughput automated DNA sequence analysis of randomly selected human 
cDNA clones (Adams, M. D. et a/., Nature 377:3-174 (1995); Adams, M. D. et a/., Nature 355:632-634 (1992); and 
Adams, M. D. et a/., Science 252:1651-1656 (1991)). Sequence homology comparisons of each EST are performed 
against the GenBank database using the blastn and tblastn algorithms (Altschul, S. F. et ah, J. Mol. Biol. 275:403-410 
(1990)). No homologous human sequences were identified. One homologous sequence - yeast hypothetical 52.9 KD 
protein - shows around 45 % similarity with HC. Electronic analysis of the distribution of the EST reveals its brain- 
specific localisation. Sequence of the gene is confirmed by double strand DNA sequencing and the gene is shown to 
be completely new by a blast search against Genbank. 

Example 2 

Cloning and Expression of HC in Mammalian Cells. 

[01 37] A typical mammalian expression vector contains the promoter element, which mediates the initiation of tran- 
scription of mRNA, the protein coding sequence, and signals required for the termination of transcription and polya- 
denylation of the transcript. Additional elements include enhancers, Kozak sequences and intervening sequences 
flanked by donor and acceptor sites for RNA splicing. Highly efficient transcription can be achieved with the early and 
late promoters from SV40, the long terminal repeats (LTRS) from Retroviruses, e.g., RSV, HTLVI, HIVI and the early 
promoter of the cytomegalovirus (CMV). However, cellular elements can also be used (e.g., the human actin promoter). 
Suitable expression vectors for use in practicing the present invention include, for example, vectors such as PSVL and 
PMSG (Pharmacia, Uppsala, Sweden), pRSVcat (ATCC 37152), pSV2dhfr (ATCC 37146) and pBC12MI (ATCC 67109) 
and pcDNA3 (Invitrogen). Mammalian host cells that could be used include, human HEK 293, H9 and Jurkat cells, 
mouse NIH3T3 and C127 cells, Cos 1, Cos 7 and mouse L cells and Chinese hamster ovary (C HO) cells. 
[0138] Alternatively, the gene can be expressed in stable cell lines that contain the gene integrated into a chromo- 
some. The co-transfection with a selectable marker such as dhfr, gpt, neomycin, zeocin or hygromycin allows the 
identification and isolation of the transfected cells. 

[0139] The transfected gene can also be amplified to express large amounts of the encoded protein. The DHFR 
:(dihydrofolate reductase) marker is useful to develop cell lines that carry several hundred or even several thousand 
copies of the gene of interest. Another useful.selection marker is the enzyme glutamine synthase (GS) (Murphy et ai, 
Biochem.J. 227:27.7-279 (1991); Bebbington et a/., Biol Technology 10A 69-1 75 (1992)). Using these markers, the 
mammalian cells are grown in selective medium and the cells with the highest resistance-are selected. These cell lines 
contain the amplified gene(s) integrated into a chromosome. Chinese hamster ovary (CHO) and NSO cells are often 
used for the production of proteins. 

[0140] The expression vector pcDNA3 (Invitrogen) contains the strong promoter (CMV) of the Cytomegalovirus. 
Multiple cloning sites allow the cloning of foreign genes. The entire coding region of HC is inserted into the expression 
vector pCDNA3 (+) by use of EcoR1 and Xhol restriction enzymes. The presence of insert and its orientation in the 
vector is detected by PCR with specific oligonucleotides. 

[0141] The expression vector carrying HC is used to transfect CHO cells, using standard protocols. Expression of 
HC polypeptide is detected by Western blot with anti-HC antibodies. Cell lysates and cell supernatants are analyzed. 

Example 3 

Tissue distribution of HC mRNA expression. 

[0142] Northern blot analysis can be carried out to examine HC gene expression in human tissues, using methods 
described by, among others, Sambrook et a/., cited above. A cDNA probe containing the entire nucleotide sequence 
of the HC protein can be labeled with 32 P using the Rediprime™ DNA labeling system (Amersham Life Science, Ar- 
lington, IL), according to manufacturer's instructions. After labeling, the probe can be purified using a CHROMA SPIN 
- 100™ column (Clontech Laboratories, Inc.), according to manufacturer's protocol number PT1200-1. The purified 
labeled probe is then used to examine various human tissues for HC mRNA. 
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MTN blot. 

[0143] Multiple Tissue Northern (MTN) blots containing various human tissues can be obtained from Clontech and 
examined with the labeled probe using ExpressHyb™ hybridization solution (Clontech) according to manufacturer's 
5 protocol number PT1190-1. Following hybridization and washing, the blots can be mounted and exposed to film at - 
70°C overnight, and films developed according to standard procedures. 

Master blot. 

10 [0144] Tissue distribution of HC is determined by hybridization of the cDNA probe obtained from the HC clone by 
Ecori-Xhol digestion. The probe is radiolabeled with [32P]dCTP. This probe is used to hybridize a Human RNA Master 
Dot blot (Clontech, ref 7770-1 ) with 50 different tissues represented (1 00-500 ng / lane) in Express Hyb buffer (Clontech) 
according to the manufacturer specifications. The signal is detected using a Storm (Molecular Dynamics). Results 
show that HC mRNA is only expressed in CNS and liver (Figure 1). 

15 

Example 4 

Functional activity of Human Carnosinase by in vitro measurements. 

20 [0145] Experiments are performed on transiently transfected CHO cells obtained using standard techniques. Two 
protocols are applied : 1) tissue carnosinase, and 2) plasma carnosinase. Both protocols give comparable results when 
applied to transiently transfected cells. 

[0146] Cells transfected with HC cDNA are diluted in 1 0mM Tris-HCI pH 8.0. Cytosol is used for activity measurement 
in presence or absence of 0.25 mM MnCI 2 , sonicated 3 times 10 sec. on ice and centrifuged 15,000 g / 10 min. at 4°C. 

25 Enzymatic assay is performed in black 96-well plates. 5 uJ of sample are mixed with 5 uJ of buffer A (125 mM glycine 
pH 9.5; 0.25 mM MnCI 2 in 10 mM Tris-HCI pH 8.0, 10 mM dithiothreitol) and incubated at room temperature for 15 
min. 5 of substate (50 mM carnosine in 125 mM glycine, pH 9.5) are added and incubated at 30° C for 60 min. The 
reaction is stopped by precipitation with 15 pi of 0.6N TCA. Measurement of Histidine is performed after precipitation 
with 30 \i\ of supernatant by addition of 1 00 uJ of 1 N NaOH and 50 \l\ of OPT (50 mg o-phthalaldehyde / 1 00 ml ethanol 

30 95%), incubation at 30°C for 15 min., addition of 50 \i\ of buffer B (56.36 g P 2 0 7 K 4 .10H 2 O + 1.65 g HNa 2 0 3 S/1 liter 
8N phosphoric acid), incubation at 30°C for 15 min. Finally, the plate is agitated and reading of the fluorescence is 
performed at room temperature (excitation 360 nm, emission 460 nm). Results appear on Figure 2. Carnosinase activity 
* - is found-exclusively in the medium of transfected cells. Two protocols are tested with cells grown in two different 

mediums : serum-free (MSSL) medium or medium supplied with fetal serum (tSVF).Jn both conditions, high activity 

35 of carnosinase is found. . 

Example 5. - - 

Expression of Human Carnosinase in Alzheimer's disease. 

[0147] Analysis of the level of expression of HC in patients with Alzheimer's disease is performed by Northern blot 
and quantitative PCR using standard protocols. Commercially available blots with total RNAs from different cerebral 
structures of Alzheimer and control patients (Invitrogen) are hybridized with HC cDNA probe. For quantitative PCR 
mRNAs are isolated from temporal cortex of control and Alzheimer patients. After the reaction of reverse transcription, 
45 quantitative PCR is done using commercially available kit from Clontech. Results demonstrate a 2.5 fold increase in 
mRNA level for HC in patients with Alzheimer's disease. 

[0148] It will be clear that the invention may be practiced otherwise than as particularly described in the foregoing 
description and examples. 

[0149] Numerous modifications and variations of the present invention are possible in light of the above teachings 
so and, therefore, are within the scope of the appended claims. 

[0150] The entire disclosure of all publications (including patents, patent applications, journal articles, laboratory 
manuals, books, or other documents) cited herein are incorporated by reference. 
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SEQUENCE LISTING 

<110> SANOFI-SYNTHELABO 
5 <120> Human carnosinase, its isolation and uses. 

<130> SANSYL010 
<160> 2 

™ <170> Patentln Ver. 2.1 

<210> 1 

<211> 1524 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . (1524) 

<400> 1 

atg gat ccc aaa etc ggg aga atg get gcg tec ctg ctg get gtg ctg 48 

Met Asp Pro Lys Leu Gly Arg Met Ala Ala Ser Leu Leu Ala Val Leu 
15 10 15 

ctg ctg ctg ctg ctg gag cgc ggc atg ttc tec tea ccc tec ccg ccc 96 
Leu Leu Leu Leu Leu Glu Arg Gly Met Phe Ser Ser Pro Ser Pro Pro 
20 25 30 

ccg gcg ctg tta gag aaa gtc ttc cag tac att gac etc cat cag gat 14 4 
Pro Ala Leu Leu Glu Lys Val Phe Gin Tyr lie Asp Leu His Gin Asp 
35 40 45 

gaa ttt gtg cag acg ctg aag gag tgg gtg gee ate gag age gac tct 192 
Glu Phe Val Gin Thr Leu Lys Glu Trp Val Ala He Glu Ser Asp Ser 

50 55 60 

gtc cag- cct- gtg cct cgc ttc aga caa gag etc ttc aga atg "atg gee- 240 - 
Val Gin Pro Val Pro Arg Phe Arg Gin Glu Leu -Phe Arg Met Met Ala 
. 65 70 75 80 

gtg get gcg gac acg ctg cag cgc ctg ggg gee cgt gtg gee teg gtg 288 
Val Ala Ala Asp Thr Leu Gin Arg Leu Gly Ala Arg Val Ala Ser Val 
85 90 95 

- gac atg ggt cct cag cag ctg ccc gat ggt cag agt ctt cca ata cct 336 
Asp Met Gly Pro Gin Gin Leu Pro Asp Gly Gin Ser Leu Pro He Pro 
100 105 110 

ccc gtc ate ctg gee gaa ctg ggg age gat ccc acg aaa ggc ace gtg 384 
Pro Val He Leu Ala Glu Leu Gly Ser Asp Pro Thr Lys Gly Thr Val 
115 120 125 

tgc ttc tac ggc cac ttg gac gtg cag cct get gac egg ggc gat ggg 4 32 
Cys Phe Tyr Gly His Leu Asp Val Gin Pro Ala Asp Arg Gly Asp Gly 
130 135 140 

tgg etc acg gac ccc tat gtg ctg acg gag gta gac ggg aaa ctt tat 480 
Trp Leu Thr Asp Pro Tyr Val Leu Thr Glu Val Asp Gly Lys Leu Tyr 
145 150 155 160 

55 gga cga gga gcg ace gac aac aaa ggc cct gtc ttg get tgg ate aat 528 

Gly Arg Gly Ala Thr Asp Asn Lys Gly Pro Val Leu Ala Trp He Asn 



20 



25 



30 



35 



40 



45 



50 



17 



EP 1 097 997 A1 



165 170 175 

get gtg age gec ttc aga gec ctg gag caa gat ctt cct gtg aat ate 576 
Ala Val Ser Ala Phe Arg Ala Leu Glu Gin Asp Leu Pro Val Asn lie 
5 180 185 190 

aaa ttc ate att gag ggg atg gaa gag get ggc tct gtt gee ctg gag 624 
Lys Phe lie He Glu Gly Met Glu Glu Ala Gly Ser Val Ala Leu Glu 
195 200 205 

10 gaa ctt gtg gaa aaa gaa aag gac cga ttc ttc tct ggt gtg gac tac 672 

Glu Leu Val Glu Lys Glu Lys Asp Arg Phe Phe Ser Gly Val. Asp Tyr 
210 215 220 

att gta att tea gat aac ctg tgg ate age caa agg aag cca gca ate 720 
ile Val He Ser Asp Asn Leu Trp He Ser Gin Arg Lys Pro Ala He 
15 225 230 235 240 

act tat gga acc egg ggg aac age tac ttc atg gtg gag gtg aaa tgc 768 

Thr Tyr Gly Thr Arg Gly Asn Ser Tyr Phe Met Val Glu Val Lys Cys 

245 250 255 

nn 

aga gac cag gat ttt cac tea gga acc ttt ggt ggc ate ctt cat gaa 816 

Arg Asp Gin Asp Phe His Ser Gly Thr Phe Gly Gly He Leu His Glu 

260 265 270 

cca atg get gat ctg gtt get ctt etc ggt age ctg gta gac teg tct 864 
Pro Met Ala Asp Leu Val Ala Leu Leu Gly Ser Leu Val Asp Ser Ser 
275 280 285 

ggt cat ate ctg gtc cct gga ate tat gat gaa gtg gtt cct ctt aca 912 
Gly His Ile Leu Val Pro Gly He Tyr Asp Glu Val Val Pro Leu Thr 
290 295 300 

gaa gag gaa ata aat aca tac aaa gee ate cat eta gac eta gaa gaa 960 
Glu Glu Glu Ile Asn Thr Tyr Lys Ala Ile His Leu Asp Leu Glu Glu 
305 310 ' 315 320 

tac egg aat age age egg gtt gag aaa ttt ctg ttc 'gat act aag gag 1008 
35 ' Tyr Arg Asn Ser Ser Arg Val Glu Lys Phe Leu Phe Asp Thr Lys Glu 

325 330 335 

gag att eta atg cac etc tgg agg tac cca tct ctt tct att cat ggg 1056 

Glu Ile Leu Met His Leu Trp Arg Tyr Pro Ser Leu Ser Ile His Gly 

340 345 350 

40 

ate gag ggc gcg ttt gat gag cct gga act aaa aca gtc ata cct ggc 1104 

Ile Glu Gly Ala Phe Asp Glu Pro Gly Thr Lys Thr Val Ile Pro Gly 

355 360 365 

cga gtt ata gga aaa ttt tea ate cgt eta gtc cct cac atg aat gtg 1152 
Arg Val Ile Gly Lys Phe Ser Ile Arg Leu Val Pro His Met Asn Val 
370 375 380 

tct gcg gtg gaa aaa cag gtg aca cga cat ctt gaa gat gtg ttc tec 1200 
Ser Ala Val Glu Lys Gin Val Thr Arg His Leu Glu Asp Val Phe Ser 
385 390 395 400 

aaa aga aat agt tec aac aag atg gtt gtt tec atg act eta gga eta 1248 
Lys Arg Asn Ser Ser Asn Lys Met Val Val Ser Met Thr Leu Gly Leu 
405 " 410 415 

55 cac ccg tgg att gca aat att gat gac acc cag tat etc gca gca aaa 1296 

His Pro Trp Ile Ala Asn Ile Asp Asp Thr Gin Tyr Leu Ala Ala Lys 
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420 425 430 

aga gcg ate aga aca gtg ttt gga aca gaa cca gat atg ate egg gat 

Arg Ala He Arg Thr Val Phe Gly Thr Glu Pro Asp Met He Arg Asp 

435 440 445 

gga tec ace att cca att gee aaa atg ttc cag gag ate gtc cac aag 

Gly Ser Thr He Pro He Ala Lys Met Phe Gin Glu He Val His Lys 

450 455 460 

age gtg gtg eta att ccg ttg gga get gtt gat gat gga gaa cat teg 

Ser Val Val Leu He Pro Leu Gly Ala Val Asp Asp Gly Glu His Ser 

465 470 475 480 

cag aat gag aaa ate aac agg tgg aac tac ata gag gga acc aaa tta 

Gin Asn Glu Lys He Asn Arg Trp Asn Tyr He Glu Gly Thr Lys Leu 

485 490 495 

ttt get -gee ttt ttc tta gag atg gee cag etc cat 

Phe Ala Ala Phe Phe Leu Glu Met Ala Gin Leu His 
500 505 



<210> 2 
<211> 508 
<212> PRT 

<213> Homo sapiens 
<400> 2 

Met Asp Pro Lys Leu Gly Arg Met Ala Ala Ser Leu Leu Ala Val Leu 
15 10 15 

Leu Leu Leu Leu Leu Glu Arg Gly Met Phe Ser Ser Pro Ser Pro Pro 
20 25 30 

Pro Ala Leu Leu Glu Lys Val Phe Gin Tyr He Asp Leu His Gin Asp 
35 40 45 

Glu Phe Val Gin Thr Leu Lys Glu Trp Val Ala- He -Glu Ser Asp Ser 
50 55 60 

Val Gin Pro Val Pro Arg Phe Arg Gin Glu Leu Phe Arg Met Met Ala 
65 70 75 80 

Val Ala Ala Asp Thr Leu Gin Arg Leu Gly Ala Arg Val Ala Ser Val 
85 90 95 

Asp Met Gly Pro Gin Gin Leu Pro Asp Gly Gin Ser Leu Pro He Pro 
100 105 110 

Pro Val lie Leu Ala Glu Leu Gly Ser Asp Pro Thr Lys Gly Thr Val 
115. 120 125 

Cys Phe Tyr Gly His. Leu Asp Val Gin Pro Ala Asp Arg Gly Asp Gly 
130 135 140 

Trp Leu Thr Asp Pro Tyr Val Leu Thr Glu Val Asp Gly Lys Leu Tyr 
145 150 155 160 

Gly Arg Gly Ala Thr Asp Asn Lys Gly Pro Val Leu Ala Trp lie Asn 
165 170 175 

Ala Val Ser Ala Phe Arg Ala Leu Glu Gin Asp Leu Pro Val Asn lie 
180 185 190 
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Lys Phe He He Glu Gly Met Glu Glu Ala Gly Ser Val Ala Leu Glu 
195 200 205 

Glu Leu Val Glu Lys Glu Lys Asp Arg Phe Phe Ser Gly Val Asp Tyr 
210 215 220 

He Val He Ser Asp Asn Leu Trp He Ser Gin Arg Lys Pro Ala He 
225 230 235 240 

Thr Tyr Gly Thr Arg Gly Asn Ser Tyr Phe Met Val Glu Val Lys Cys 
245 250 255 

Arg Asp Gin Asp Phe His Ser Gly Thr Phe Gly Gly He Leu His Glu 
260 265 270 

Pro Met Ala Asp Leu Val Ala Leu Leu Gly Ser Leu Val Asp Ser Ser 
275 280 285 

Gly His He Leu Val Pro Gly He Tyr Asp Glu Val Val Pro Leu Thr 
290 295 300 

Glu Glu Glu He Asn Thr Tyr Lys Ala He His Leu Asp Leu Glu Glu 
305 310 315 320 

Tyr Arg Asn Ser Ser Arg Val Glu Lys Phe Leu Phe Asp Thr Lys Glu 
325 330 335 

Glu He Leu Met His Leu Trp Arg Tyr Pro Ser Leu Ser He His Gly 
340 345 350 

He Glu Gly Ala Phe Asp Glu Pro Gly Thr Lys Thr Val He Pro Gly 
355 360 365 

Arg Val He Gly Lys Phe Ser He Arg Leu Val Pro His Met Asn Val 
370 375 380 

Ser Ala Val Glu Lys Gin Val Thr Arg His Leu Glu Asp Val Phe Ser 
385 390 395 400 

Lys Arg Asn Ser Ser Asn Lys Met Val Val Ser Met Thr Leu Gly Leu 
405 " 410 415 

His Pro Trp He Ala Asn He Asp Asp Thr Gin Tyr Leu Ala Ala Lys 
420 425 430 

Arg Ala He Arg Thr Val Phe Gly Thr Glu Pro Asp Met He Arg Asp 
435 440 445 

Gly Ser Thr He Pro He Ala Lys Met Phe Gin Glu He Val His Lys 
450 455 460 

Ser Val Val Leu lie Pro Leu Gly Ala Val Asp Asp Gly Glu His Ser 
465 470 475 480 

Gin Asn Glu Lys He Asn Arg Trp Asn Tyr lie Glu Gly Thr Lys Leu 
485 490 " 495 



Phe Ala Ala Phe Phe Leu Glu Met Ala Gin Leu His 
500 505 
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Claims 

1. An isolated polynucleotide comprising a nucleotide sequence at least 80% identical to a nucleotide sequence 
selected from the group consisting of : 

(a) the nucleotide sequence set forth in SEQ ID NO: 1; 

(b) a nucleotide sequence encoding the HC polypeptide set forth in SEQ ID NO:2; 

(c) the nucleotide sequence comprising the sequence located between nucleotide numbers 1 to 1524 set forth 
in SEQ ID NO: 1; 

(d) the nucleotide sequence encoding a polypeptide comprising amino acids from about 255 to about 272 set 
forth in SEQ ID NO:2; 

(e) the nucleotide sequence encoding a polypeptide comprising amino acids from about 475 to about 490 set 
forth in SEQ ID NO:2; 

(f) a nucleotide sequence encoding the HC polypeptide expressed by the cDNA clone contained in ATCC 
deposit number PTA-541; 

(g) a nucleotide sequence able to hybridize with the nucleotide sequence set forth in SEQ ID NO:1 or the 
cDNA clone contained in ATCC deposit number PTA-541 ; 

(h) a nucleotide sequence encoding the mature HC polypeptide expressed by the cDNA clone contained in 
ATCC deposit number PTA-541 ; 

(i) a nucleotide sequence complementary to any of the nucleotide sequences defined in (a), (b), (c), (d), (e), 
(f), (g) or (h) above, as well as fragments, derivatives, variants, mutated forms of these nucleotide sequences. 

2. Polynucleotide according to claim 1 , wherein it comprises the nucleotide sequence located between nucleotide 
number 1 and nucleotide number 500, or the nucleotide sequence located between nucleotide number 501 and 
nucleotide number 1 524 of the sequence set forth in SEQ ID NO:1 . 

3. A DNA or RNA molecule constituting a recombinant vector, comprising an expression system which is capable of 
producing a HC polypeptide when said expression system is present in a compatible host, wherein said DNA or 
RNA molecule comprises a polynucleotide according to claim 1 or 2. 

4. A host cell comprising the expression system of claim 3. 

5. A process for producing a HC polypeptide comprising the steps of transforming or transfecting a host cell with an 
expression vector according to claim 3, culturing said host cell under appropriate conditions so that said host cell 
produces HC polypeptide, and recovering the polypeptide from the culture. 

6. An isolated HC polypeptide comprising an amino acid sequence which is at least 80% identical to a sequence 
selected from the group consisting of : 

(a) the amino acid sequence of SEQ ID NO:2; 

(b) the amino acid sequence of HC polypeptide having the complete amino acid sequence encoded by the 
cDNA clone contained in ATCC deposit number PTA-541; 

(c) the amino acid sequence of the mature HC polypeptide having the complete amino acid sequence encoded 
by the cDNA clone contained in ATCC deposit number PTA-541; as well as fragments, derivatives, variants 
or mutated forms of these amino acid sequences. 

7. Polypeptide according to claim 6, susceptible to be obtained by the process of claim 5 or by chemical synthesis. 

8. An antibody, or a fragment thereof, able to recognize a HC polypeptide according to claim 6. 

9. Antibody according to claim 8 able to recognize a polypeptide comprising amino acid residues from about 255 to 
about 272 of SEQ ID NO:2, a polypeptide comprising amino acid residues from about 475 to about 490 of SEQ 
ID NO:2. 

10. A method for identifying agonists of HC polypeptide comprising the steps of: 

(a) contacting a host cell according to claim 4 with a candidate compound; and 

(b) determining whether the candidate compound effects a signal generated by activation of HC polypeptide. 
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11. An agonist of HC polypeptide identifiable by the method of claim 10. 

12. An agonist according to claim 11, wherein the candidate compound is selected among compounds which present 
a structural or functional, or both, similarity with known agonists of carnosinase or of the metal loprotease family, 

5 or compounds which are known to be involved in a large number of biological processes. 

13. A method for identifying antagonists of HC polypeptide comprising the steps of : 

(a) contacting a host cell according to claim 4 with an agonist of HC polypeptide, and 
10 (b) determining whether the signal generated by said agonist is diminished in the presence of a candidate 

compound. 

14. An antagonist of HC polypeptide identifiable by the method of claim 13. 

15 15. Antagonist according to claim 14, wherein the candidate compound is selected among compounds which present 
a structural or functional, or both, similarity with known antagonists of carnosinase or of the metal loprotease family, 
or compounds which are known to be involved in a large number of biological processes. 

16. A process for diagnosing a disease or a susceptibility to a disease in a subject related to expression or activity of 
20 a HC polypeptide according to claim 6 in a subject comprising: 

(a) determining the presence or absence of a mutation in the nucleotide sequence encoding said HC polypep- 
tide in the genome of said subject; and/or 

(b) analyzing for the presence or amount of said HC polypeptide expression in a sample derived from said 
25 subject. 

17. Pharmaceutical composition, for preventing and/or treating in a subject a disease or susceptibility to a disease 
related to an under-expression or under-activity of HC polypeptide, said composition comprising a therapeutically 
effective amount of at least an agonist of HC polypeptide or a derivative or homologue thereof identified by the 

30 method according to claim 10, and/or at least a HC polypeptide according to claim 6, and/or at least a nucleotide 

sequence able to express in vivo a HC polypeptide according to claim 6. 

18. Pharmaceutical composition for preventing and/or treating in a subject a disease or susceptibility to a disease 
. - - - - related to over-activity or over-expression of HC polypeptide, said composition comprising a therapeutically effec- 
ts tive amount of at least an antagonist of HC polypeptide or a derivative or homologue thereof identified by the 

method according to claim 13, and/or at least a nucleic acid molecule that inhibits the expression of the nucleotide 
sequence encoding HC polypeptide and/or at least a peptide. which competes with HC polypeptide. 

19. Pharmaceutical composition according to claim 17 or 18, comprising a recombinant vector according to claim 3, 
40 able to express in vivo a polypeptide according to claim 6. 

20. Pharmaceutical composition according to any of claims 17 to 19, further comprising at least a pharmaceutically 
acceptable carrier. 

45 21. Pharmaceutical composition according to any of claims 17 to 20 for prevention and/or treatment of Alzheimer's 
disease and cognitive disorders, developmental abnormalities and fetal deficiencies, neurodegenerative disorders 
such as amyotrophic lateral sclerosis, Parkinson's disease, schizophrenia, abnormal mental states, ischemic 
shock, epilepsia, among others. 

50 22. Use of a HC polypeptide according to claim 6, for the treatment of Alzheimer's disease and cognitive disorders, 
developmental abnormalities and fetal deficiencies, neurodegenerative disorders such as amyotrophic lateral scle- 
rosis, Parkinson's disease, schizophrenia, abnormal mental states, ischemic shock, epilepsia, among others. 

23. Use of a HC polypeptide according to claim 6, for the preparation of a pharmaceutical composition for the treatment 
55 of Alzheimer's disease and cognitive disorders, developmental abnormalities and fetal deficiencies, neurodegen- 

erative disorders such as amyotrophic lateral sclerosis, Parkinson's disease, schizophrenia, abnormal mental 
states, ischemic shock, epilepsia, among others. 
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